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Abstract —Differential Unitary Space-Time Modulation 
(DUSTM) and its earlier nondifferential counterpart, USTM, 
permit high-throughput MIMO communication entirely without 
the possession of channel state information (CSI) by either 
the transmitter or the receiver. For an isotropically random 
unitary input we obtain the exact closed-form expression 
for the probability density of the DUSTM received signal, 
which permits the straightforward Monte Carlo evaluation 
of its mutual information. We compare the performance of 
DUSTM and USTM through both numerical computations 
of mutual information and through the analysis of low- and 
high-SNR asymptotic expressions. In our comparisons the 
symbol durations of the equivalent unitary space-time signals 
are both equal to T, as are the number of receive antennas N. 
For DUSTM the number of transmit antennas is constrained by 
the scheme to be M = T / 2, while USTM has no such constraint. 
If DUSTM and USTM utilize the same number of transmit 
antennas at high SNR’s the normalized mutual information 
of the differential and the nondifferential schemes expressed 
in bits/sec/Hz are asymptotically equal, with the differential 
scheme performing somewhat better, while at low SNR’s the 
normalized mutual information of DUSTM is asymptotically 
twice the normalized mutual information of USTM. If, instead, 
USTM utilizes the optimum number of transmit antennas then 
USTM can outperform DUSTM at sufficiently low SNR’s. 

Index Terms —Non-coherent Communication, Capacity, Space- 
Time Coding, Multiple Antennas, Differential Encoding, Multi¬ 
plicative Channels. 

I. Introduction 

ONSIDERABLE volume of work has followed the pre¬ 
diction [1], [2] that the use of multiple antennas in trans¬ 
mitting and receiving signals can result to substantial increases 
in information throughput. The underlying assumptions of this 
effort have been that the receiver knows the channel through 
some training scheme and that the channel coefficients are 
statistically independent. In this case and for large signal to 
noise ratio p, the capacity is roughly 

C co h ~ min(M, N) log 2 p bits/sec/Hz (1) 

where M, N are the numbers of transmitting and receiving 
antennas. 

In a typical mobile wireless communication system the 
channel coefficients vary continuously, following a Jakes-like 
distribution. Thus one can only assume that the channel is 
approximately constant over only limited periods of time. 
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Hence, especially for large transmitting antenna numbers, 
training will require a substantial fraction of the coherence 
time of the channel and thus hamper the data throughput 
rates. To address this problem, Marzetta and Hochwald [3], 

[4] investigated the scenario where the receiver has no a-priori 
channel knowledge. In addition to the conventional additive 
Gaussian noise, this channel has also multiplicative noise, 
corresponding to the channel matrix, which is also assumed 
to be Gaussian. This is a “non-coherent” channel, as opposed 
to the additive white Gaussian noise channel with known (and 
static) “coherent” channel coefficients at the receiver. In an 
elegant group-theoretic approach, Zheng and Tse [5] found 
the capacity of this channel to scale as 

C lncoh « Af*( 1 - M*/T) log 2 p bits/sec/Hz (2) 

for large p, where M* = min(M, N, T/2) and T is the 
number of time intervals over which the channel is static. A 
similar approach was developed independently by [6]. This 
implies that for fixed T, there is no need to use more than 
M = T/2 transmitters. 

To take advantage of the constancy of the channel over 
T time intervals, [4] proposed to encode the signal using 
T x M isotropic unitary matrices. In this encoding, called 
isotropic unitary space-time modulation (USTM), a symbol 
can be spread not only over M antennas, but also over T 
time intervals. Some analytic results on the mutual information 
of USTM already exist. In particular, it has been shown that 
for T M [3] and for M < min(iV, T/2) and large p 

[5] the optimal input distribution is isotropic random unitary, 
i.e. that of USTM. Thus the asymptotic capacity is equal to 
the mutual information, as in [2], [3], [5], Recently, Hassibi 
and Marzetta [7] analytically calculated the received signal 
distribution and thus were able to numerically evaluate the 
mutual information of USTM for a variety of M, T, N and 
p , confirming some of the above asymptotic results. More 
recently, [8] generalized the received signal distribution to 
channels with spatial correlation. 

In the case of USTM it is implicitly assumed that, after T 
symbols the channel completely changes. In contrast, differen¬ 
tial phase-shift keying (DPSK) [9] has been used extensively 
to take advantage of the continuous slow-varying nature of 
the channel, without needing to perform any training. In this 
scheme, each transmitted symbol is encoded into a phase- 
difference from the previous symbol. 

In [10], [11], the concept of differential modulation was 
extended to multi-antenna systems. In this method, called 
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differential unitary space-time modulation (DUSTM), the sig¬ 
nal is encoded over M transmitting antennas and M time 
intervals using an M x M unitary matrix. In each successive 
M time intervals, the transmitter encodes the input signal by 
multiplying a M x M unitary matrix to the unitary matrix 
transmitted during the previous M time intervals over the M 
antennas and then transmits the matrix product. In turn, the 
receiver decodes the signal by comparing the received signal 
from the M antennas and M time intervals to that received 
over the previous M time intervals. Thus this scheme requires 
no training and assumes that the channel is fixed over T = 2 M 
time intervals. The technique of DUSTM can be applied to the 
mathematically identical space-frequency channel that appears 
during a single OFDM symbol interval, resulting in a vari¬ 
ation called differential unitary space-frequency modulation 
(DUSFM) [12], 

Despite its importance in practical applications [10], no 
analytic results are available regarding the mutual information 
of DUSTM and its comparison with USTM for T = 2 M. 
The main obstacle has been the difficulty in integrating over 
exponentials of unitary matrices. This is a problem that was 
tackled in the 80’s by high-energy physicists in analyzing the 
nuclear strong interactions (quantum chromodynamics). Due 
to the SU( 3) symmetry of these interactions their fluctuations 
can in certain cases be represented by unitary matrices. Thus 
to integrate them out, one needs to make use of such integrals 
of exponentials of unitary matrices. In this paper we apply 
these results derived by [13] to the context of DUSTM. 
The methodology of the proof in [13] is based on mapping 
the original problem to a diffusion problem of eigenvalues, 
which has a differential equation that can be solved. Given 
its complexity it will not be discussed at all in this paper. 
However, the interested reader is referred to [8], where some 
of us apply the method of character expansion to derive the 
same result and apply it to the capacity of Ricean MIMO 
channels. In the present paper, we get the following results: 

1) We analytically calculate the received signal distribution 
for the case of DUSTM (see section ITTTIi . 

2) Using this received signal distribution, we evaluate nu¬ 
merically Idustm, the mutual information of DUSTM 
for a variety of M, N and p, and compare it to Ijjstm, 
the mutual information of USTM setting T = 2 M. At 
low p we find that the two mutual informations for the 
same M, N, T = 2 M are nearly identical. This implies 
that the number of bits per symbol i.e. Idustm /M is 
twice Iustm/T = Iustm /2M. In contrast, at large p 
the number of bits/symbol of the two schemes approach 
each other, but with Idustm /M > Iustm/T. 

3) We compare the maximum with respect to M of 
the two mutual informations per symbol. For fixed 
M, N, T = 2 M, we find that while at large 
p we have max M * <m Idustm (M* ,N,p)/M* > 
ma x m *<m Iustm(M*,N, p,T)/T, at small p the op¬ 
posite inequality holds. 

4) We back the above numerical results by providing ex¬ 
pansions of the mutual information for both small and 
large p. 


II. Definitions 

A. Notation 

Throughout this paper we will denote the number of time- 
intervals, transmitting antennas and receiving antennas with 
T, M, N, respectively. R, K and Q will represent R = 
min (M,N), K = min (T,N) and Q = ma x(M,N) — 
min (M, N). 

In addition, we will use bold-faced upper-case letters to 
represent matrices, e.g. X, with elements given by Xij, 
bold-faced lower-case letters for column vectors, e.g. x, 
with elements Xi, and non-bold lower-case letters for scalar 
quantities. Tr{X} will represent the trace of X, while the 
superscripts T and f will indicate transpose and Hermitian 
conjugate operations. The determinant of a matrix will be 
represented by det(X) or by det(A'y). Also, I n will denote 
the n-dimensional identity matrix, while J„ will represent a 
T x T matrix with zeros in all elements other than the first n 
diagonals, which have unit value. 

The complex, circularly symmetric Gaussian distribu¬ 
tion with zero-mean and unit-variance will be denoted by 

CAf(0,l). 

The per-symbol normalized mutual information will be 
given by /, measured in bits/sec/Hz. Thus for the case of 
USTM, Iustm = Iustm/T , while for DUSTM, Idustm = 
Idustm /M. 

B. System Model 

We consider the case of single-user transmission from M 
transmit antennas to N receive antennas over a narrow-band 
block-fading channel. The channel coefficients are assumed 
to be constant over time intervals of length T, after which 
they acquire independent values, which in turn remain constant 
for the same time interval. The received T x TV-dimensional 
complex signal X can be written in terms of the T x M- 
dimensional transmitted complex signal $ as 

X=\— $H + W (3) 

V M 

where H is a M x N matrix with the channel coefficients 
from the transmitting to the receiving arrays and W is the 
T x N additive noise matrix. Both H and W are assumed to 
have elements that are independent and CAf(0, l)-distributed. 
Their instantaneous values are assumed to be unknown to 
both the transmitter and the receiver. The first term in 0 is 
normalized, so that p is the total average signal-to-noise ratio 
(SNR) transmitted from all antennas. 

C. Unitary Matrices for Isotropic and Differential USTM 

In this paper we will be dealing with unitary input distribu¬ 
tions For the case of USTM is a member of the S(M, T ) 
Stiefel manifold (see [14]) i.e. the set of all complex T x M 
matrices, such that 

= 1m (4) 

Note that it is implicitly assumed here that T > M, since only 
thus can M T-dimensional vectors be mutually orthogonal. 
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It is convenient here to introduce <F L , the T x (T — M) 
orthogonal complement of d>, i.e. with 


= I T and TV = I t- 


T-M 


$ = -i 


1JM 


V 2 L u J 

where U is an M x M unitary matrix. 
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We can therefore express p(X|T?) as 
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The mutual information between X and <1> is given by 


7(X;$)= / / dXp(X|$)log, 


( P(X\*) \ 


V P(X) / 


T?(l = 


Im 

Ot-m 


while for DUSTM it is convenient to use 


(5) 
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IM 

Im 


(14) 


so that $ = [T? $ j_] is a T x T unitary matrix with $1$ = 

= I rp_ 

For the case of DUSTM, we restrict ourselves to the U(M) 
subgroup of the S(M, 2M) Stiefel manifold, such that [10] 


( 6 ) 


which is the identity matrix of matrices of the form of 
0. Using O and through the change of variables X 
i> X, which leaves the X-integration measure unaffected, 
we completely eliminate any non-trivial TAdependence of the 
integrand of <E3. The remaining f cid>p(T>) can be easily 
integrated to give unity and thus is disregarded. This results 
to 


/(X;<&)= J dXp(X|d>o)log 2 ( P( ^ 0) 


(15) 


D. Mutual Information 

For fixed T>, X in 0 is a sum of two Gaussian matrices, 
therefore its probability density conditional on T> can be 
written as 

exp | -Tr | Xl ' 
p(X|T>) = - 


(7) 


III. Closed-form solution oFp(X) for DUSTM 
When dealing with DUSTM, it is convenient to express the 
conditional probability in terms of U, defined in 0 . Thus, if 
we express X as 

X=[XiX 2 ] T (16) 

where both Xx and X 2 have dimensions M x N, then 0 can 
be rewritten in terms of X x , X 2 and U as 


To evaluate the inverse of the matrix in the exponent we use 
0,0 to get the expression T>T?1 = T/Ja/TV. Applying this 
we get 


P(X|U) = 


exp(-^Tr{xlX 1 + Xtx 2 }) 


x exp 


( 8 ) 


7 t 2MN (1 + 2 P )MN 
P 


(17) 


1 + 2 p 


Tr |x 2 X|U + XiXjlJl | 


Combining this with CD we get 

ex P ( — TT2^Tr {xjXi 


XtX; 


p(X) = 


]) 


MN 


exp 


where 


^2MN (i _|_ 2p) 

(/3Tr |x 2 X|U + XxX^Ul}) 

P = 


1 + 2 p 

We can now use the result of [13] to get 


(18) 


(19) 


(9) 


exp 


(/3Tr |x 2 X|U + XxXlul}) 

det (y] ,1)/2 / i i (2t/j 


M—1 


n 


( 20 ) 


( 10 ) 


p(X) is the received signal probability density given by 

p(x) = J d$ P (x i$) = <p(x m (id 

where we introduced the notation (■) as the integration over 

The integration over $ in Co} can be eliminated by noting 
[7] first that 

p(X|$)=p(*tX|* 0 ) (12) 

The choice of T?o depends on the particular application. Thus, 
for the case of USTM the following expression can be used 


where yj 
l2 Y .vt 


fe =o det (y} 

for j = 1... M are the eigenvalues of 


/3 2 XxXTX 2 Xl (or the squares of the svd’s of /3X 2 X|). This 
equation is essentially the generating functional of U: Any 
moment of U can be evaluated by taking arbitrary derivatives 
with respect of elements of the matrix X 2 X j on both sides 
of E3 and subsequently setting this matrix to zero. 

The determinant in the denominator is the Vandermonde 
determinant 


(13) 


det^- 1 ) 



1 

1 

1 

yi 

2/2 

2 /M 

y\ 
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(21) 


































4 


while the determinant in the numerator can be written explic¬ 
itly as 


review here the results obtained in [7] regarding USTM. We 
start with the conditional probability p(X|$) 


det(yf 1 )/ 2 Vi(2yj /2 )) = (22) 


Wyl /2 )) 

4(2 y]i 2 ) 

y\ /2 h^) 

2/M 2 4(2y^ 2 ) 

yih{2y\ /2 ) 

VMhi^y]! 2 ) 

(M-l)/2 r / 9 l/2x 

Vi 4 M -i(2j/ 1 ) 

(M-l)/2j / 9 l/2\ 


where I n (x ) is the modified Bessel function of order n. 

One has to exercise caution in evaluating <E3 in the case 
M < N. The reason is that only R singular values of 
XiXg are non-zero. Therefore, both the determinants in the 
numerator and the denominator vanish. However, the ratio 
remains finite. Using Lemma Q in Appendix [I] we can show 
that 

(exp (/3Tr{x 2 X|U + XiX'Ut})) = (23) 

det(v' M - K+ ‘- 1 » / 2 2 M-* + ,- I ( 2 ! ,‘ /3 )) 

1 f fc! _ 1— _ _ _ L 

k=M-R QCT W ) 

where the range of the indices in the determinants are i,j = 
1 


IV. Mutual Information of DUSTM 

Using dl 71 . d 1 81 and d23b we can now express the ratio 
p(X|* 0 )/p(X) as 


log 2 

log 2 


f p(X|$ 0 ) 

V P(X) 


n M — 1 l i , ( A 

k—M — R k\ ^et \Vj 


M-R+i- 1\ 


/3Tr + X3,X!} - 2^; 


1/2 


(24) 


, , / (M-R+i-l)/2 f 1/2 

det yyj I M -R+i- i(2y/ 


log 2 e 


In the above equation we have defined I n (x) = I n {x)e x and 
we have multiplied both numerator and denominator of the 
expression inside the log with exp 2 XliLi. so that 

neither will have exponentially increasing terms for large y,;. 

To evaluate the mutual information, O needs to aver¬ 
aged over realizations of Xi, X 2 , which are generated with 
probability distribution p(X|<I?o). This corresponds to Xi, X 2 
having Gaussian correlations given by 

E[X* lia X lj0 \ = (1 + p)6 ij 6 af) (25) 

E[X* 2ia X 2j p\ = (1 + p)5 ij 5 ap (26) 

E[X* lia X 2jP \ = pSijS^ (27) 


V. Mutual Information of USTM 


P(X|$) 


exp (-Tr {XtX}) exp (aTr {Xt**t X }) 

ir TN (l+pT/M) MN ( ’ 


where X is a T x N complex matrix, $ is a T x N unitary 
matrix and 


pT 

M + pT 


(29) 


In [7] the received signal probability density was found to be 


p(X) = J d‘MX|$) (30) 

exp (-Tr {XtX}) 
ir TN (1 + pT/M) MN 
x (exp(aTr{X t $$ t X})) 

where the average over < 1 *, expressed as (• • •) was performed 
as follows: 


(exp(aTr{Xt$$tX})) (31) 

Ctm C dti f dtM 

M\ J 2 n"'J ~2T 


{—otyi - it m ) ■ ■ ■ {-ayx - K 

X - iti) 2 

l<m 

= Ctm | det F | 


M 

n 

m= 1 L 


where the constant Ctm is equal to 

(T — 1)! • • • (T — M)\ 

C ™ (M — 1)! • • ■ (0)! 

and F is a M x M Hankel matrix with entries given by 

K 


(32) 


- E 


o^Vk 


(ay k ) q Yli^k{ a Vk - ayi) 


(33) 


7(g.<»3/fc) 

r(<?) ’ 

1 , 


q> 1 

q < 0 


In the above expression, q = T— K — m — n+2, 7 ( 71 , 2 ) is the 
incomplete T function and y n , for n = 1,..., K are the non¬ 
zero eigenvalues of the N x N matrix XiX. As in the case of 
DUSTM, to numerically calculate the mutual information one 
needs to average the log-ratio log 2 (p(X|«I?o)/ / p(X)), where 
is given by d!3> . with respect to X, which has probability 
density p(X|<I>o). It is convenient to write X^X as 

XtX =(l + ?Cj x{Xx + xtx 2 (34) 


In the next section we will compare the mutual information 
of DUSTM to that of USTM. Thus, for completeness, we 


where Xi, X 2 ar e M x N, (T — M) x N complex Gaussian, 
unit-variance matrices. 
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VI. Analysis and Comparison to Isotropic USTM 
In section IVI-CI below, we present numerical results on the 
mutual information of DUSTM and compare them to corre¬ 
sponding USTM results. However, before that, it is instructive 
to analyze the asymptotic behavior of the mutual information 
in both small and large SNR regimes. As we shall see, this 
exact asymptotic analysis of both USTM and DUSTM will 
provide insight and quantitative agreement with numerical 
simulations. 


A. Low p region 

To obtain the small p behavior we expand the exponent 
in the log-ratio log 2 (p(X|«I>o)/p(X)) and integrate over the 
fields. For the DUSTM case in Appendix III- Al we obtain 

1 - 2 p+?~ ( 5 -^- 


Idustm ~ P N 


M 


log 2 


(35) 


For small small p, we see that Idustm is an increasing 
function of M. As a result, under the constraint of the channel 
being constant over T time-intervals, the optimal number of 
transmitting antennas is M opt = T /2. 

For comparison, in Appendix HUBl we calculate the mutual 
information for USTM for small p. The final result up to 0(p' i ) 
is 


IjJSTM 


Np 2 

~2M 


(T - M) 


2 pT 
3M 


p 2 N 


(1-2 p) log 2 e 



log 2 e 

(36) 


where the last equality holds for T = 2 M. We see that for 
T = 2 M, Idustm ~ 2 Iustm up t° order 0(p 3 )\ Also, for 
fixed T and N, Iustm is actually a decreasing function of 
the number of transmitting antennas M, with optimal M = 1. 
This can be seen in Fig. 0 where the optimal M at low p is 
1 . 

It is important to note that for p <C 1, the mutual information 
for both schemes scales as p 2 , rather than p as in the coherent 
case. This behavior has been pointed out by [15], [16]. Thus, 
at small SNR, the lack of knowledge of the channel becomes 
increasingly problematic. This is generally the case for unitary 
space-time modulated schemes. 


B. High p region 

In Appendix ID we obtain the large p behavior of the 
mutual information of DUSTM, which to 0(log 2 p/p) is 


Idustm 


1 

M 




log 2 P T Amn 


(37) 


where 


R { R\ 

Amn = — log 2 (47 t) - R ( M - — J log 2 (2e) 

M—l 

^2 log 2 k\ + R 

k=M-R 

+ ^R(R-l)C 2 (M,N) (38) 


M-R+-) A(M,7V) 


is a constant, independent of p. In CD we have defined 
the quantities C\{M,N) = E [log 2 Ai] and £ 2 (M,N) = 
-E[log 2 (Ai + A 2 )J, where Ai 2 are distinct non-zero singular 
values of an M x N matrix with independent CAf(0, 1) entries. 
Their explicit expressions are given in d86> . (87). 

Similarly, in Annendix llll-BI we derive the asymptotic large- 
p form of the mutual information for USTM (for T > M) 


Iustm 


+ 


— [R(T - M) log 2 p + B T mn] 



(39) 


with 


Btmn = R(T-M)(log 2 ^- + £i(M,N)} (40) 
- log C T m ~ log 2 | det G | 


with Ci(M,N) given in d86l> . The last term appears only 
for M < N and the elements of G are given in d94t . It 
is important to note that for T = M the mutual information 
vanishes to the order calculated above, since in that case the 
mutual information is identically zero. 

The leading terms, proportional to log 2 p in 03 and 03 
provide insight on the large p behavior of DUSTM and USTM. 
Starting with d37t . we find that for fixed N, the mutual 
information Idustm is an increasing function of M. Thus, 
as we found in the small p case in the previous section, 
to maximize the mutual information, one should use the 
maximum number of transmitting antennas consistent with the 
constraint that the channel is constant over 2 M time-intervals. 

In the case of USTM we find that, for T > 2 N the optimal 
transmitting antenna number is M opt = N, while in the 
opposite case T < 2 N, the leading term is optimized for 
Mopt = T/2. 

Once optimized over M, the leading terms of both <E3 
and (1391 are identical to | 3 - Thus, to leading order in p, both 
DUSTM and USTM are capacity achieving schemes. Com¬ 
paring the next-to-leading /^-independent terms in 03, 
we find that, after optimizing over M, the mutual information 
of DUSTM is larger than that of USTM. This can be seen 
in Fig.0 where the optimized-over-M Idustm and Iustm 
of 1371 and d39> are plotted (dashed lines). This may come 
as a surprise if one takes into account that for T = 2 M, the 
manifold of constellations used for DUSTM © is a subgroup 
of those used in USTM. However, one should take into account 
that in DUSTM, although information is sent over M time- 
intervals, the receiver exploits the side information that the 
channel has not changed over the previous M time-intervals. 


C. Numerical Simulations 

We now discuss the numerical simulations performed to 
evaluate the mutual information for USTM and DUSTM. The 
simulation procedure consists of the following steps: First we 
generate L instances of Gaussian complex random matrices 
with covariance given by m and (03 for the DUSTM and 
USTM cases. For each matrix instantiation we calculate the 
singular values and then we apply them to evaluate the log- 
ratio log 2 (p(X|$)/p(X)), which we then average over its 
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L values. For intermediate and large p we have found that 
L ft 4 — 5 ■ 10 4 are sufficient. However, for smaller p, 
at least L = 5 ■ 10 5 are required. The reason is that the 
mutual information, being 0(p 2 ), is quite small and therefore 
fluctuations have a more pronounced effect. 

In Fig. HI we compare the numerically evaluated mutual 
information of USTM and DUSTM for low, intermediate and 
relatively large SNR values. We find that for small p = —6 dB 
the normalized mutual information I dustm is nearly exactly 
twice Iustm- This is in agreement with <E} and CD- Even 
for intermediate SNR, p = 6 dB we find the approximate rela¬ 
tion Iustm(T = 2M, M, 2N) ft Idustm(T = 2 
This approximation breaks down for larger p. 

Motivated by these ratio dependencies and scaling relations, 
in Fig.0we analyze the dependence of ratios of Idustm and 
Iustm on SNR . In Fig. 0a) we plot the ratio Idustm{T = 
2M,M,N = rM)/i UST M(T = 2M,M,N = rM) as a 
function of p for various values of M and for r = 1/2, r = 
1 and r = 2. We find that for fixed r, the ratios fall close 
(but not on top) to each other. Their value starts from very 
close to 2, for small p and in accordance with C3, CD, and 
approaches 2(1 — 0.5 min(l, r)), in agreement with ( 137b . (139b . 
We note however the slow convergence to their asymptotic 
values for large p , which can be explained by the fact that 
both mutual informations increase only logarithmically with 
p. The closeness of the curves for fixed r indicates that the 
ratio has weak dependence on the actual values of T, M, N. 
Thus a large-T, M, N analysis is expected to give good results 
even for small antenna numbers. 

In Fig. 0b) we plot the ratios Idustm (T = 2 M, M, N = 
rM) / (MIdustm(2, 1, N = 1) as a function of p for various 
values of M and r. 

In Fig.0we analyze the mutual information of DUSTM and 
USTM optimized over the number of transmitting antennas M 
with T fixed to T = 8 and for various values of N. In Figs 
0a), (b) we plot the capacity of DUSTM and USTM defined 
as 


by simply maximizing the log p term in 0 [5] and in CD. 
actually becomes optimal at very large p ~ 50 dB. 

Turning now to Fig. 0a) we see that at relatively small 
SNR, Cjjstm and Cdustm actually cross each other. At 
high SNR, DUSTM consistently performs better than USTM. 
At low SNR, USTM, when optimized over M performs better 
than DUSTM. This can be explained by looking at the leading 
term of (136b : the optimal M is M opt = 1 and Iustm (T, 1, N) 
can be higher than Idustm(T,TN). Interestingly, the 
analytic estimates at low SNR do not match very accurately 
to the behavior at p ft —6 dB. 

VII. Conclusions 

In conclusion, we have found a closed-form expression for 
the probability density of the received signal for differential 
unitary space-time modulated (DUSTM) signals. This allowed 
us to evaluate numerically the corresponding mutual infor¬ 
mation. In addition, we calculated analytically the asymptotic 
form of the mutual information for DUSTM and USTM for 
small and large SNR’s. At low SNR’s the nondifferential 
form of USTM can outperform the differential form if the 
number of transmit antennas is optimized. However, at high 
enough SNR’s the differential USTM outperforms its nondif¬ 
ferential counterpart with respect to mutual information. An 
additional advantage of DUSTM over USTM is its simplicity 
of decoding, though recent progress has been reported for 
decoding of nondifferential USTM [17]. This suggests that 
DUSTM is a promising type of transmission for non-coherent 
MIMO channels. It would be interesting to test the competitive 
advantage of differential USTM in cases when T > 2M, for 
example when T is a higher multiple of 2 M. In that case the 
successive use of differential USTM could be assessed. 
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Cdustm = max Idustm(T* = 2M*, M*, N\A\) 

M*<T/2 

= Idustm{T,T/2, N) 

Cu stm = max-IusTM(T,M*,N) (42) 

as a function of p (solid curves). In Fig. 0c) the solid 
curves depict the optimal number of M that maximizes 

Iustm{T, M, N) 

M opt = arg max Iustm (T,M*,N) (43) 

as a function of p. As seen in (14 1 b . the optimal M for 
DUSTM is always equal to M = T/2, consistent with both 
low and large p analysis. In Figs. 0b) ,(c) the dashed curves 
represent the capacity and optimal M values as evaluated 
using the large-/-; asymptotic expressions of 03. 03- Very 
good agreement with the exact values (solid curves) can be 
seen down to moderate SNR. However, one should note, that 
even though ( 139b describes the capacity accurately down to 
moderate SNR, the large-/; optimal value of M as predicted 


Appendix I 


Lemma 1: Let fj(xi) represent the i,j -th element of a 
T x T dimensional matrix. Here fj(x) for j = 1 ,...,T is 
a family of analytic functions and {xt} is a T-plet of real 
numbers. For simplicity we represent this matrix in terms of 
its columns denoted by f(ar*) = [fi{xi) h(xi) ... f T (xi)] T . 
Also we denote by A({cCj}) the Vandermonde determinant of 
the Xj’s 

A({zi}) = det(a;^ _1 ' ) ) = - Xi) (44) 

j>i 


Thus, in the limit that a subset of k members of the T-plet 
are equal with each other (i.e. x\ = ... = Xk, for k < T), 
then the ratio of det fi(xj)/A({xi}) exists and is equal to 


lim 

4Xi i— 2,. 


det [f(xi) f(z 2 ) ■ ■ ■ f(zr)] 

n,-=i Ui 


Lj=l X j ) 

det [f(cci)f (1) (a;i) ... f (fe 1) (xi) f(zfe+i) ... 




n«>j, j=k+i( xi x j ) rim=fc+i( a '' 


m=fc+ 


(45) 
I( X T )] 

- Xi) k 
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Mutual Information for DUSTM and USTM for various N; T = 2M; SNR = -6 dB 



Mutual Information for DUSTM and USTM for various N; T = 2M; SNR = 6 dB 



(a) p = —6.0 dB 


(b) p = 6.0 dB 


Mutual Information for DUSTM and USTM for various N; T = 2M; SNR = 18 dB 



(c) p = 18.0 dB 

Fig. 1. Plot of normalized mutual information Idustm (solid lines) and Ijjstm (dashed lines) as a function of the number of transmit antennas M for 
different receive antenna numbers N and three SNR levels p. The coherence interval T is chosen to be T = 2 M for proper comparison. 


where f( n )(a;) denotes the n-th derivative of each of the 
elements of the vector f (x) evaluated at x. 

Proof: This can be proved by successively applying the 
F Hospital limit k — 1 times on the numerator and denominator 
of GD- For the pth application of this rule (p < k ) we 
calculate the limit of x v+ \ —> X\. For example, if k = 2, both 
numerator and denominator in g5) have a simple zero in the 
limit x 2 —■> X\ . Therefore, by taking a single derivative of both 
and setting x 2 = X\ in the result gives the correct answer. For 
k = 3, we first take the x 2 —> X\ limit as above and then we 


take the limit x$ —> x\. Now both top and bottom expressions 
of the ratio in d45b go to zero quadratically in (xy x-|). Hence 
one has to take the second derivative with respect to X 3 on 
both top and bottom expressions. For a full proof see [8] ■ 

Appendix II 
Small p Analysis 

In this section we will calculate the first four terms in the 
Taylor expansion in p of the mutual information for both 
the differential and the isotropic USTM cases. The mutual 
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Ratio of Normalized Mutual Information of DU STM to that of USTM versus SNR for various M, N Ratio I dustm (T= 2M ’ M ’ N)/(m2 'dustm* 2,1 - 1 W for various M - N versus SNR 




(a) I DU st{T — 2M, M, N ')/ 1jjjst(T = 2AT, M , AT) 


(b) Idust(T = 2M, M, AT)/( m1^ust{T = 2,1,1)) 


Fig. 2. (a) Plot of the ratio between the mutual information per symbol for the Differential USTM and t he I sotropic USTM (with T = 2 M) as a function 

of SNR. For low SNR, the ratio approaches 2, as seen in the previous figure a nd in agre ement with on m- For large SNR the ratio appears to approach 
1 (for M = N and M = N/2) and 1.5 (for M = 2AT), as predicted from <371 . (39}. For M = N and M = 2N, the ratio appeal's not to depend on 
the number of antennas for any intermediate p. (b) Plot of ratio between the mutual information per symbol for the Differential USTM for various antenna 
numbers to that for M = N = 1. 


information /(X; 4>), dl5> can be rewritten as 

/(X; 4>) = J dXp(X |d> 0 ) log 2 < ^^° ) ) > (46) 

where (•) denotes average over 4*. The simplest way to 
proceed is to expand both logarithms in powers of p and, 
where convenient, interchange the integration over X and 4>. 
The expectation over X will be denoted by E[-]. 


A. Differential USTM 

In the case of DUSTM, we see that by taking the ratio 
of p(X|U = I m) in (I17> and p{X) in d 1 8t the mutual 
information can be written as 


I = (3E 
- E 


Tr jx 2 X{ + XiXj 
log 2 (exp (>r{x 2 Xl 


}] !og 2 


(47) 


U- 


XrXjut}); 


Since Xi, X 2 are zero-mean Gaussian quantities, we only 
need to specify their variances given by C2}. As a result, the 
first term in can be easily evaluated to give 


PE 


{x 2 Xl + XxX^}] = 2 ppMN (48) 

To deal with the second term in g7}, we also need the 
following identities for the averages over U. 


( UijUn.) — ^jSuSjk 

2d+l 

(Uu igjg ) = 0 

9=1 


(49) 

(50) 


where r = 0,1, • • •. Note that, since U is an element of U (M), 
all odd moments vanish. However, even moments other than 
the second one are not easy to evaluate. In fact, even using 
the simple-looking form of CD does not simplify matters too 
much. 

To expand the exponent of the second term in <|47} in powers 
of (3 we use the notation 


A n = (TrjXaXlU + XxX^})” (51) 

We see that due to ( I5UI . all odd terms vanish, (A 2r+ 1) = 0. 
Thus, to A th order in 3, m can be written as 

I = (2 p p MN-^-E[(A 2 )} (52) 

- ^(El(A 4 )}~3E[(A 2 ) 2 ])y og2 e 

From we get 

< A 2 >=-|Tr{x 2 X t 1 X 1 xt} (53) 

which results to E [(A 2 )] = p 2 MN 2 + (1 + p) 2 MN 2 . Since 
3 = p/(l + 2 p) is O(p) for small p , we only need to evaluate 
the averages involving A 4 and A 2 to leading order in p, i.e. to 
0(1). Thus, we may neglect the O(p) terms in the correlations 
between Xi and X 2 (see (ED). As a result. 


E [(A 2 ) 2 ] 
E((A 4 )} 


4N 2 (1 + M 2 ) + O(p) 

(E [A 4 ]) (54) 

127V(l + MV)Tr{(UU t UU t )} + O(p) 
12MN(l + MN)+0(p) 
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Capacity of DUSTM and USTM (Low SNR region); Mutual Information of USTM maximized ovt 



SNR (dB) 


(a) Capacity-small p region 


Capacity of DUSTM and USTM; Mutual Information of USTM maximized over M 



(b) Capacity-full region 


Optimal Number of Transmit Antennas as a function of SNR for USTM 



(c) Optimal number of M for USTM 


Fig. 3. Plot of capacity of DUSTM and USTM as a function of p. Here T is fixed to T = 8. Fig.|3ja) depicts the small-p region, while Fig. |3jb) shows the 
full region of p-values simulated. Fig. 0C) represents the optimal number of transmitting antennas f or U S TM as a function of SNR and for various values of 
N. The dashed curves in Fig. 0b) ,(c) correspond to the analytic expressions valid for large p (see |37}, Eft- 


Collecting all terms from above and expanding them to 0(p x ) 
we obtain the mutual information of CSJ. 


B. Isotropic USTM 

In the case of USTM, we will expand Iustm to order p 3 . 
Here, the analog of g7} is 

I = aE [Tr (X f J M X}] log 2 e (55) 

— E [log 2 (exp (aTr {X^UJ M U f X}))] 


from d34l 


E [X* 0 X kl ] = SikSji (1 + Jn (56) 

As a result, the first term in (|55} can be easily evaluated to 
aE [Tr {X f J M X}] = MNa/{ 1 - a) = TNp (57) 
Similarly to the previous section, we define B n as 


where U is a T x T unitary matrix and the T x N Gaussian 
random matrix X has the following correlations, which follow 


B n = (Tr{XtUJ M UtX}) n 


(58) 
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Then, after expanding the second term in i !55t to order a 3 , I the independent M x N matrices Z± with CM (0,1) entries, 
becomes defined as 


I = log 2 e[MN-^--aE[{B 1 )] (59) 

\ 1 — a 

cy 2 

- Y (E[(B 2 )}-E[(B i y-]) 

y {E [(B 3 )] + 2 E [(Si) 3 ] - 3 E [(Bi)(B 2 )])) 

Using the orthogonality relation for U(T) unitary matrices 

{UM) = i s u 8 jk (60) 

we can calculate (B\) to be 


Z+ 

z 


X!+X 2 

^2(1 + 2 p) 

Xi -X 2 

V2 


( 66 ) 


Thus /f 2 X 2 X|XiX2 can be written as a sum of terms with 
decreasing powers of p : 


/3 2 X 1 Xtx 2 X[ 


where 



-Hi 

Vp 




(67) 


<Bi >=^Tr{Xtx} 

We can now calculate the terms in E3 explicitly: 

E[(B 1 )] = MN (l + TT^) 


EKB^-EKB,) 2 ] = 


= MN 



{E [B 2 ]} — E [(Bi) 2 ] 
+ 2 p—+ ®(P 2 ) 


(61) 


(62) 

(63) 


E [<B 3 >] + 2 E [(Bi) 3 ] - 3 E i(B 1 )(B 2 }] (64) 

= 2 MN (l - f-) (l - 2 p^pj + 0(p ) 

Note that the last two equations were only calculated to O(p) 
and 0(1), given that their proportionality constants in < !59b are 
0(p 2 ) and 0(p 3 ), respectively. Collecting all terms (I62l >. ( I63> 
and ( 1641 ) together in (|5l, we get the mutual information for 
USTM to 0(p 3 ) expressed in ( I36> . 


Appendix III 
Large p Analysis 

A. Differential USTM 

We wish to calculate the asymptotic behavior of the 
DUSTM mutual information for large p. Using EH}, we 
rewrite the log-ratio of El} as 


log 2 ( ^ ) = (ippMN - 2 Vthj log 2 e 

M—l 

^2 lo &(65) 

k=M-R 


+ log 2 


det (y. 


det 


M-R+i-1 

Vj 


'j 


lM-R+i-l(2y/yj 


where t/j, for i = 1, • ■ ■ , R are the R eigenvalues of the 
matrix /T 2 X 2 X jXiX^. The above equation is averaged over 
the M x N Gaussian matrices Xi, X 2 with correlations given 
by E3- To analyze the large p behavior, it is convenient to use 


n 0 

= Af+ 


(68) 

Hi 


^(z_z\_ - Z+zVj Af+ + h.c^j 

(69) 

n 2 

= K 

-Eat + a^_+a^_aa + ) 

(70) 


l(z_zi-z + z!) 2 


and A f± = Z±Zj_. 

To leading order in p, we can neglect the higher order 
terms in d67l and only keep the term proportional to 7L 2 . 
In this case, the eigenvalues of the left hand side of E3 
are yi = (/3pAj) 2 , where \ are the eigenvalues of A f +. We 
will need to calculate j/j to next to leading order, focusing on 
the R non-zero ones. To do this we need to express the full 
eigenvalues yi as well as their corresponding eigenvectors as a 
Taylor expansion in the small parameter 1 / yfp. Applying the 
normalization condition of the eigenvectors at every order we 
obtain an expression for the corrections of the eigenvalues in 
terms of the eigenvalues and eigenvectors of the unperturbed 
matrix, i.e. Tt 2 . The perturbation analysis of eigenvalues is 
treated in detail in standard textbooks, see for example [18]. 
Below we simply quote the answer: 


v, = /jy[A? + d^ + d^ (71) 




p 


+ -E 

p 


IntWinj 

, \ 

X 2 - A 2 

-+0(p~ 3 / 2 ) 

1 3 

/ 


where n; are the eigenvectors corresponding to A,. The last 
term in the above equation is summed over all A j, including 
zeros, and is well behaved because the eigenvalues A, are 
unequal with probability 1. We next observe that, since n, 
are eigenvectors of 7i 0 , n|?T i n, = 0. We now can expand 
the second term in E3: 


R 


2EV^ = 2 (3p^2 


11)7^211, 

Ai ~r 


2p\ 
2 


(72) 


+ 


-E- 


n, Ti i iLj 




o{p- 2 ) 
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To proceed further, we integrate out Z in the above equation 
(but not Z + ). As a result we get 


2 E] ^ = 2 @P Xi 

i= 1 i 

+ /?^A. i + (M-N)Rj 

which, after integrating over Z + gives 


(73) 


0{P~ 1 ) 


2 E 


r r 




2 (3pMN (74) 

R ( M ~§) +v(p~ 1 ) 


Since the eigenvalues of 'Hq are equivalent, we need only 
to evaluate the averages E [log 2 Ai] and E [log 2 (Ai + A2)] 
over the M x N Gaussian matrix Z + . Careful analysis of 
the correction term shows that it is 0(log 2 p/p). 

To calculate these quantities we need the single eigenvalue 
probability density p(A) as well as the joint two eigenvalue 
probability density p(Ai,A2) for the random matrix Hq = 
Z\_ Z + . Using Telatar’s analysis [2], it can be shown that 


P( A) = 
p(Ai,A 2 ) = 


x 


X Q e A 
R 


p 2 (A, A) 


A? A^e“ (Al+A2) 


R{R - 1) 


(79) 

(80) 


(p2(Ai, Ai)p 2 (A 2 , A 2 ) — p 2 (Ai, A 2 )“) 


Thus, the first term in the above equation, cancels the first 
O(p) term in d65l . with the remainder being only of order 
unity. 

We now turn to the asymptotic treatment of the determinants 
in m- Since for large p the non-zero tp’ s will be large, 
we may use the asymptotic form of the normalized modified 
Bessel function 


In (*) = e~ x I n (x) 

« -i(l + 0(x~ 1 )) 

V Z 7 TX 

in the determinant of the denominator in to obtain 

M-R+i-1 


det 


Vi 


= det 


V An VyJ 

’ {(3 p\j) M ~ R+i ~ l 


(1 + 0(y- 1/2 )) 


\ y/toppXj 

(/3p) {2M_ ' R_2)ii/2 
7 r/2 


-(i + oip- 1 )) 


(75) 


( 47r ) ij<i 

x [l + ^p- 1 )] 


n - a.) 


176) 


where p 2 (Ai, A 2 ) is given by 


p 2 (Ai, A 2 ) - ^2 ( k + qy L k ( Al ) L fc l 81 ) 

and L k (*) is the associated Laguerre polynomial of order k. 
Since both p(A) and p(Ai,A 2 ) are finite polynomials in Ai, 
A 2 times a exponential factor, they can be explicitly integrated 
using the following identities several times: 



= n!m!T'(n + m + 2) log 2 e 


The first equality follows from the fact that tp = /? 2 p 2 (A 2 + 
0(p~ 1 ). Similarly, the Vandermonde determinant can be ex¬ 
pressed as 


where C is the Euler constant C = 0.57721 • • •. To somewhat 
simplify the procedure, we apply the Christofel-Darboux iden¬ 
tity (see [19]) 


= det([UVA, 2 (l + 0 („- 1 ))] M -' ,+ ‘-') 

= (ft,)"' 2 "-*- 1 ' n A 2 (M - R, (A 2 -A 2 ) M 2 ( Ai ,A 2 ) = 


i,j<i 


x [l + Oip- 1 )] 


(77) 


Taking the logarithm of the ratio of the two determinants < 1761 . 
f77). we get 


log; 


det(- • •) 
det(- • •) 


R 


p R 


= R \ M - -77 lo S T + 77 lo §2 (78) 


which, in the limit 


E f84 > 

R\ 

(R + Q-l)\ 

^- 1 (A 1 )^g(A 2 )-£g(A 2 )Lg_ 1 (A 1 ) 

Ai — A 2 

A 2 —> Ai becomes 


+ (m-R+^J2 l°g 2 A* 

+ log 2 (Ai + Aj) + log 2 (l + Q(p 1 )) 

i,j<i 


p 2 (Ai, Ai) 


R\ 

(R + Q — 1 )! 
(Lg_ 1 (A 1 )L«li(A 1 ) 


(85) 




Q+i 
R -2 



X 
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Combining {75J, 4HH> and we get 

Ci{M,N) = E [log A] = 

R -1 

E (- 1 ) fc+m 


(R-l)! 

(A + Q-l)! 


( 86 ) 


k,m —0 
R,R-2 


Q + R-l 
R-l-k 


Q + R 
R — 1 — m 


Q + R 
R-k 


- e (-!) fc+m 

k,m =0 

{Q + k + m)\'i'(Q + k + m + 1) 


Q + R-l 
R — 2 — m 


k\m\ 


log 2 



'y ' ^_ ^k 2 +m 2 

fc 2 ,m 2 =0 

R,R-2 

_ 'y ' ^_^^fe 2 +m 2 

fc 2 ,m 2 =0 


Q + R - l 
R -1 - fc 2 


Q + R 
R — k 2 


Q + R- 1 
R — 2 — TO2 


(Q + A:i + mi)!(<3 + k 2 + m 2 )\ 

k\\k 2 \m\\m 2 \ 

x >]/(2Q + k\ + mi + fc 2 + to 2 + 2) log 2 e 

R-1,R 


fci ,mi=0 


Q H - — 1 

r - i - &i 


Q + R 

R — TOi 


^ ' ^_^^fc 2 +m 2 

k 2 ,m 2 =0 


sgn(ki — mi) 
k\\m\\ 


Q + R- 1 
R - 1 - k 2 


sgn(k 2 — m 2 ) 


|fci—mi|—1 

E 

Pi—0 

Q + R 
R — m 2 J 

|/c2 —m 2 1 — 1 


E 

P2=0 


k 2 \m 2 \ 

x (Q + max(fci, mi) — 1 — p! + max(fc 2 , m 2 ) - 1 — p 2 )\ 
x (Q + min(fc 1 , mi) + min(fc 2 , m 2 ))\ 


B. Isotropic USTM 

To analyze the large p behavior of mutual information of 
USTM, we start by writing the mutual information as 

P(X|* 0 ) 


luSTM = E 


log 2 


( 88 ) 


In the above equation the last term, which appears outside the 
bracket, refers to both double sum-terms inside the bracket. 
Similarly, by combining < 18II . (1831 and (1841 , we get 

C 2 (M,N)=E[log 2 (A!+A 2 )] = 

Q + R 
R — 1 — mi 

Q + R- 1 
R— 2 — mi 

Q + R 
R — 1 — m 2 


P(X) 

= olE [TrjX^MX}] log 2 e 
- E [log 2 (exp (aTr {X^^X}))] 

= TNp\og 2 e-E [log 2 (exp(aTr{X t ** 1 'X}))] 

where the third equality is obtained by integrating over X, 
see (EH. To evaluate the second term we will perform an 
asymptotic analysis of the multiple integration in ED. which 
is performed by evaluating the residues of the poles of the 
t-integrals. We will assume that T > M, since otherwise the 
mutual information is identically zero. We also use the fact 
that at large p from El- the eigenvalues of X^X generally 
split into three groups: the first R being large 0(p~ 1 ), K — R 
eigenvalues being 0(1), while the remaining N — K being 
zero. For simplicity, we assume they are ordered in magnitude, 
i- e - 2/i > 2/2 > • ■ ■■ Note first that the last term in ED 
guarantees that no two ti s are evaluated at the residue of the 
same pole with y n / 0. As a result the leading term will entail 
min (A', M) t’s evaluated at the poles of the 0{p) eigenvalues 
of X^X. All other terms will be exponentially smaller. Let us 
start with the simpler case of M < K. Here the M /-integrals 
are all performed by taking their residues at the Ad O(p) yt s. 
Thus we get 


(exp(aTr{X t $$ t X})) 

M 

C TM 


(89) 


m=illf=i, q ?m( a y™-ay q )(ay m ) T K 
(ayi - at/m) 2 


l<m 


M 


= Ctm E 


0 Otym 


nf=m +1 {ay m - ay q )(ay m ) 

1 (1 + 0 (p~ 1 )) 


G 


TM 


T-K 


M 

TT 


ii 

m =1 

.(<* 2 lm) T ~ M _ 


where in the last step we used the fact that the eigenvalues 
2 /m+i, • • •, yx are 0(1), while y m for m = 1,..., M are 
y m = O(p). Thus for Ad < K the mutual information can be 
written as 


Iustm = 


M 


T Np — a E E [y n 


m =1 
M 


log 2 e - log 2 Ct,m (90) 


+{T - Ad) E E [log 2 (ay m )\ + 0(log 2 p/p ) 


m= 1 


x^(2 Q + h+ mi + k 2 + m 2 ) log 2 e (87fhat for Ad < K 


Using a similar analysis as in Appendix lIH-AI it can be shown 


M 


aJ2 E [2/m] = TNp + (T - Ad)R + O (p’ 1 ) 


(91) 


As before, the terms outside the curly brackets are common to 
all sums inside the brackets preceding them. After collecting m=1 

all terms we can now evaluate the DUSTM mutual information To calculate the expectation of log 2 ay m , we note that to 
to order 0(log 2 p/p). leading order we have y m « pTA m /M + 0(1), where A m 
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are the eigenvalues of XjXi, with Xi a M x N Gaussian 
random, unit-variance matrix. Thus we can use (1861 . 

When M > K = TV, we have the added complexity that 
only I\ yn s are O(p). After performing the first K ^-integrals 
by evaluating them at the poles of these K O(p) y's, (13 1 1 
becomes 


(exp(aTr{X t $$ t X})) 

K 


(92) 


Ctm 


n 


p^Vm 


(M-K)\ 11 (ayro )T-x 

dt m e~ iXm nf=i(-ay<j - itm) 


27r 


n 

m=K+ 1 1 

jPJ ( ^A m *Aj 


( ^A m) 


T-K 


l>m 


The M — K remaining integrals have high-order poles at zero. 
It is straightforward to show that the above equation becomes 


(exp (oTrjX^^X})) 


(93) 


K 

~ Ctm JJ 

m= 1 


gOt-Um 

{ay m ) T ~ M 


|det G| (1 + 0(p~ 1 )) 


where G is an (M — Jv)-dimensional square Hankel matrix 
with elements 


G 


mn — 


(T — K — m — n+1)! 

0 


m + n <T — K + 1 
otherwise 


(94) 


As a result, for M > N and large p the mutual information 
is asymptotically equal to 


luSTM 


(T - M)R 


l pT . r 

‘° g! Ml + C ' 


log 2 |det G| + 0 ( log 2 p/p) 


log 2 Ct,m 

(95) 
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